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The notion of a 17-statistic for an n-tuple of identical quantum systems is introduced in 
analogy to the classical (commutative) case: given a selfadjoint 'kernel' K acting on (C d )® r 
with r < n, we define the symmetric operator U n = (™) Yip with being the kernel 

acting on the subset (3 of {1, . . . ,n}. If the systems are prepared in the i.i.d state p® n it is 
shown that the sequence of properly normalised [/-statistics converges in moments to a linear 
combination of Hermite polynomials in canonical variables of a CCR algebra defined through 
the Quantum Central Limit Theorem. In the special cases of non-degenerate kernels and kernels 
of order 2 it is shown that the convergence holds in the stronger distribution sense. 

Two types of applications in quantum statistics are described: testing beyond the two simple 
hypotheses scenario, and quantum metrology with interacting hamiltonians. 

1 Introduction 

Let X\, . . . , X n be an i.i.d. sample from a distribution P over R. A [/-statistic is a generalisation of 
the notion of sample mean 



to the case where one averages a (symmetric) kernel h{x\, . . . , x r ) over all possible ways of choose r 
different variables out of (X\ , X n ) : 



v 7 P 

The study of [/-statistics is a classic topic in mathematical statistics [30l [33] , whose roots go back 
to the papers by Hoeffding [TB] and Halmos [TJ]. Numerous examples of such statistics are used 
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in both estimation and testing, their main advantages being the intuitive character and tractable 
asymptotic properties. 

In this work we introduce the notion of [/-statistic for non-commutative variables appearing in 
quantum mechanics, prove an analogue of the 'classical' convergence theorem, and sketch a few 
applications in quantum statistics. We take the view that Quantum Statistics is an extension of 
classical (commutative) statistics and that techniques from the latter can be generalised and used to 
solve quantum statistical problems. This approach proved its fruitfulness right from the beginning of 
the field [161 [T9, 3 with the development of quantum Cramer- Rao inequalities and quantum Fisher 
information techniques, the analysis of group covariant models, and more recently the (asymptotic) 
solutions of the optimal state discrimination problem |TTJ[24l[l], the optimal state estimation problem 
[21 HU [HI [20] , the theory of quantum sufficiency [26] . 

Our study of quantum [/-statistics is motivated by their applications in estimation and testing 
for large samples of identically prepared, independent quantum systems. Consider n such systems 
prepare in the unknown state p £ M(C d ) and suppose that we are interested in a certain (non-linear) 
functional of 9 = f(p) which has the form 

9 = Tr(p® r K) 

where if is a selfadjoint operator on (C rf )® r which can be chosen to be symmetric under permuta- 
tions. Then by measuring K we obtain an unbiased estimator of 9, but in this way we use only r of 
the n systems. The linear combination of all such kernels acting on different tuples of systems 
has the same basic property but has the advantage of a smaller variance 

^=(;)"i> (/?) - (^) 

Another application is parameter estimation in quantum metrology [51 [71 [2"5] . Assume that H n :— 
\)U n is a multi-body hamiltonian with r-body interactions, and let V>7™ := ex-p(itjH n )ip® n be the 
evolved state at time t starting from a product state. In [TJ it is shown that 7 can be estimated 
with precision n~ r+1 / 2 by means of separate measurements on the n systems. Already for r = 2 
this rate is faster than the famous 'Heisenberg limit' nT 1 which is the absolute estimation rate for 
non-interacting hamiltonians! 

Although definition (|1.1[) is identical to that of classical [/-statistics, the existing theory cannot be 
applied since in general the terms of the sum do not commute with each other. However, guided 
by the classical technique called the Hoeffding decomposition we show that U n can be written as 
a polynomial in fluctuation observables ¥ n (A) := ^= X^-i an d empirical averages P n (B) :— 
— Y^7=i ■ By applying a version of the Quantum Central Limit Theorem [25] we obtain the main 
result, the convergence in moments Theorem 16 .71 This says that asymptotically with n, the moments 
of the rescaled [/-statistic n c ^ 2 U n converge to moments of a certain polynomial in quantum canonical 
variables of a CCR algebra with respect to a Gaussian state. The integer 1 < c < r is a degree of 
degeneracy and depends on the statistics of K with respect to the state p® r . In important special 
cases, such as order 2 kernels and non-degenerate kernels this can be strengthened to convergence in 
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distribution. In particular, in the latter case, the asymptotic distribution is normal, similar to the 
Central Limit Theorem for fluctuations of collective observables. 

The paper is organised as follows. In section [2] we give a short review of the classical theory of U- 
statistics culminating with the convergence Theorem 12.71 Section [3J deals with standard notions on 
Gaussian states of harmonic oscillators and ends with Theorem [3T2] which will be needed in the main 
result. The theorem shows that the symmetric products of Hermite polynomials in the canonical 
variables are orthogonal on the lower order polynomials, with respect to the inner product defined 
by a thermal equilibrium state. Section @] formulates the Quantum Central Limit Theorem [25 for d 
dimensional systems and gives an explicit description of the canonical commutation relations (CCR) 
algebra emerging in the limit, as a product of d(d — l)/2 harmonic oscillators in thermal states and 
a (d — l)-dimensional classical Gaussian variables. 

In section [6] we introduce the notion of quantum [/-statistic and analyse its asymptotic behaviour 
by using the Hoeffding decomposition for L 2 -spaces of non-commuting variables derived in section 
[H The main result of the paper is Theorem 16 . 71 which shows that properly normalised [/-statistics 
converge in moments to a certain polynomial in the canonical variables of the CCR algebra. In 
section [8] we discuss two possible applications of the theory. The first one is the construction of 
quantum tests for various non-standard discrimination problems such as distinguishing between a 
given state and everything else. The second application is in quantum metrology where symmetric 
r-body interaction hamiltonians can be used to beat the n _1 Heisenberg limit in estimation precision. 
These applications will be investigated in more detail elsewhere. 

2 Classical U-statistics 

In this section we give a short introduction to the theory of [/-statistics [33]. Throughout, X\, . . . , X n 
will be a random sample from an unknown distribution P over the measure space (fi, £). We consider 
the problem of estimating the parameter 



where h is a known function which can be taken to be symmetric with respect to permutations of 
the arguments. By definition h{X\, . . . ,X r ) is an unbiased estimator of 9 but uses only the first r 
variables, so we would like to replace it by a better estimator based on the whole sample, where 
n > r. 

Definition 2.1. Let X%, . . . , X n be a random sample from an unknown distribution. Let h(xi, . . . , x r ) 
be a kernel which is invariant under permutations of the arguments. The U-statistic with kernel h 
is defined as 
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where the sum is taken over all unordered subsets f3 of integers from {1, . . . ,n}. 

Since the observations are i.i.d. U n is an unbiased estimator of 9 and has smaller variance than 
h(Xi, . . . , X r ). In fact for non-degerate kernels y/n(U n — 9) is asymptotically normal. 



9 = E(h(X 1 ,...,X r )) 
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Theorem 2.2. Let h(X\, ...,X r ) be a symmetric kernel with Eh 2 (Xi, . . . X r ) < oo. Let 

h^x) :=E(h(x,X 2 ,...,X r ))-6 

and suppose that ^ = Eh\(Xi) ^ 0. Then 

Vrl(U n -9) -^N(0,r 2 ^). 

Example 2.3. A U -statistic of degree r = 1 is a mean ^ X^jLi M-^*) anc ^ ^ e a ^ 0-f;e statement is 
the Central Limit Theorem. 

Example 2.4. Let /i(xi,X2) = (x\ — x 2 ) 2 /2 be a kernel of degree r — 2. In this case 

9 = ^E{X 1 - X 2 ) 2 = E(X 2 ) - (E(X)) 2 = Var(X). 
The corresponding U -statistic is the sample variance 

i<j i—1 

whereX := \ Y0L\ X i- We have M*) = 5 [( x ~ H*)) 2 - Var(X)] and£i = Var(pf-Epf )) 2 )/4 ^ 
unless X — E(X) has a distribution which is concentrated on two points {a, —a}. 

If =0 then the convergence to normal distribution does not hold, and one calls the kernel h 
degenerate. In such cases the limit distribution is that of a polynomial in Gaussian variables. The 
proof uses an elegant tool called the Hoeffding decomposition which we describe below. 

Let L 2 (fl n , E™, P") be the space of square integrable functions of (X\, . . . ,X n ). For every subset 
B C {1, ■ ■ ■ ,r} we identify the subspace Lb '■= L 2 (il B , E B ,P B ) consisting of functions depending 
only on the variables (Xi : i £ B). The projection Qb onto this subspace is simply the conditional 
expectation 

Q B f = E(f\Xi :ieB). 

By construction, these subspaces are not mutually orthogonal but respect the partial order defined 
by set inclusion 

B' C B C B > C C b . 

In order to compute covariances, it would be more convenient to associate to each subset A a subspace 
Ha which contains only those vectors in La which are orthogonal onto all smaller subspaces Lb>- 
Thus Ha consists of functions g(Xi : i G A) such that 

E(g(Xi : i e A)\Xj : j e B) = 0, for all B :\B\< \A\. 

Here are the simplest examples of how the projection Pa onto Ha acts: 

P$f = E(/), 
P {i} f = ECfLYO-ECf) 
P {ijy f = E(/|^,X,)-E(/|X 4 )-E(/|X,)+E(/) 
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Theorem 2.5 (Hocffding decomposition). Let f e L 2 (fl n , E™, P"). T/ien i/ie projection of f onto 
Ha is given by 

Pa! = E (-1) |A| - |B| E(/|^ : i € B). 

BcA 

The spaces Ha are mutually orthogonal and span L 2 (Q n , E™, P"). 

A simple way to visualise the subspaces Ha is to construct an orthonormal basis. Let 1 = f , /i, ■ ■ ■ 
be an orthonormal basis of L 2 (n, E, P). Then "Ha is the span of basis vectors of the form 

fh ® • • • ® /i n 

for which ife > 1 if and only if fc e A 

We now apply the Hocffding decomposition to [/-statistics. A combinatorial argument shows that 

u » = (?) " E E E wr* .•■•>^) = E(T) ^ 

where U n .i is the {/-statistic corresponding to the symmetric kernel hi of order I, equal to the 
projection 

hi := P{i,...,i}h. (2.1) 

Lemma 2.6. Let h(x\, . . . ,x r ) be a kernel of of order r and let U n be the associated U -statistic. 
Then 

Var(^) = EQ 2 (")~W 
If hi = ■ ■ ■ = h c -i = and h c ^ then Var([/„) is of order 0(n~ c ). 

Proof. From the definition of hi it follows that U n ^ and U n ,j are not correlated for i ^ j. On the 
other hand the variance of U n .i can be easily computed as 

Var(t/ n ,0= Q J2E(h l (X f)l ,...,X f)l )h l (X 0li ,...,X0, i ))= Q E(tf(Xi, . . . , X,)) 

which is of order n _i . Then 

Var(£/„) =E Q (") ^(X!,...,^)). 

□ 

This lemma shows that the dominant contribution to U n comes from f7„ iC and suggests that n c l 2 U n 
may have a limit distribution as in the non-degenerate case. We will then formulate the limit theorem 
for the [/-statistic associated to the kernel h c defined as above. 



5 



Theorem 2.7. Let h c (Xi, . . . , X c ) be a symmetric kernel such that Kh~(Xi, . . . , X c ) < oo and 
M(h c (xi, . . . , x c -i, X c )) = 0. Let 1 = /o,/i,... be an orthonormal basis of L 2 {Q,, E, P). Then the 
sequence of U -statistics U n . c with kernel h c satisfies 

n c/2 [/„ :C A (hcfk 1 ®---®fk c )l[H a . (k) (m(f j )). 

k=(fc 1 ,...,fc c )£N'= 3 

Here H a are the Hermite polynomials, B(/j) are independent standard normal variables and ctj(k) 
is the number of times fj occurs among fk t , ■ ■ • , fk c ■ The variance of the limit variable is equal to 
c\Eh 2 c (X 1 ,...,X c ). 



3 Thermal equilibrium states of a harmonic oscillator 

In the classical case, the limit of [/-statistics is described in terms of polynomials in normally 
distributed variables. The simplest quantum analogue is a quasifree (Gaussian) state a quantum 
harmonic oscillator. The first part of this section gives a short overview of the basic notions needed 
in the main result and for the formulation of the Quantum Central Limit Theorem. 

In the second part we prove a technical result which will be employed in the convergence theorem of 
[/-statistics: the symmetric product of Hermite polynomials in the canonical variables is orthogonal 
to all lower order polynomials, where the inner product is given by the covariance with respect to a 
thermal equilibrium state. 

A variety of quantum systems such as the free quantum particle, the quantum harmonic oscillator, 
the monochromatic light beam are described mathematically by the same algebra of observables 
generated by the 'canonical coordinates' Q and P satisfying the Heisenberg's commutation relations 

[Q,P]~QP-PQ = il. (3.1) 

These observables can be represented as (unbounded) selfadjoint operators on L 2 (R) 

I aq 

where ip is a vector in the domain of the respective observable. The space L 2 (R) has a special 
orthonormal basis {|0) , |1) , . . . } with the vector |m) given by 

H m (q)e-" 2 / 2 /(^2 m m\) 1 / 2 , 

where H m are the Hermite polynomials [9 . These are the eigenvectors of the number operator 
N := \(Q 2 + P 2 — 1) counting the number of 'excitations' of the oscillator or the number of photons 
in the case of the light beam, such that N |m) = m \m). 

In order to avoid technicalities related to the fact that Q and P are unbounded operators, one can 
work instead with the unitary operators U(a) := exp(iaQ) and V(b) :— exp(ibP) and encode (|3.1|) 
into the following relation 

U(a)V(b) = exp(iafc) V(b)U(a). 
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By combining the two families of unitaries into a single Weyl operator 

S(a, b) := cxp{-^iab) U(a)V(b), {a, b) e M 2 , 



we obtain a projective unitary representation of the abelian group 



•,2 



S(a, b)S(c, d) = exp ( % -{ad - 6c) J S(a + c,b + d). (3.2) 

Since the algebra generated by the Weyl operators is dense in B(L 2 (M.)) with respect to the weak 
topology, we will refer to the latter as the algebra of observables of the harmonic oscillator. 

The pure state |0)(0| is called the vacuum or zero temperature state and satisfies 

(0\S(a,b)\0) =cxp(-i(a 2 +& 2 ) 

From this it follows that in the vacuum both Q and P have centred normal distributions known 
as 'vacuum fluctuations', with variances (0|Q 2 |0) = (0|P 2 |0) = 1/2. Their product is equal to the 
minimum of 1/4 allowed by the Heisenberg's uncertainty principle. 

Besides vacuum, we will be interested in thermal equilibrium states 

^(S(a,&))=exp(-y(a 2 + & 2 )), a 2 > ~ (3.3) 

where the variances of Q, P depend on the inverse temperature /3 as a 2 = (2 tanh(/3/2)) . Thermal 
equilibrium states are mixed and their density matrix <p a is a diagonal in the number operator basis 

oo 

CT = (l-e-^)5>Xfc|e-^. (3.4) 

fc=0 

Both vacuum and thermal states are particular examples of quasifree states, the latter being char- 
acterised by the fact that all linear combinations of canonical variables have Gaussian distributions. 
Any quasifree state can be obtained by applying a sequence of unitary operations called displace- 
ment, phase transformation and squeezing to a thermal or vacuum state [2"12] . 

Since the Hermite polynomials are orthogonal with respect to the normal distribution N(0, 1/2) we 
have 

(0\H n (Q)H m (Q)\0) = -j%J H n (x)H rn (x)e- x2 = 5 n , m 2 m m\ (3.5) 

Let 4> a be the thermal equilibrium state defined in (13.31) and (|3.4[) . and let L 2 ((j) a ) be the completion 
of B(L 2 (U.)) with respect to the complex inner product 

(A,B)t„ =Tr(^A*B). (3.6) 

By a change of variable (|3.5I) can be rewritten as 

(H n (Q/V2a)), H m (Q/V2a)))^ = <S„, m 2 m m!, 

and a similar orthogonality relation can be derived for P. Since in the commutative case H n (x)H m (y) 
is a basis in L 2 (R 2 , exp(— (x 2 +y 2 )), this suggests that L 2 {<f) a ) may have an orthogonal basis consisting 
of some type of products of Hermite polynomials in Q and P. 
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Definition 3.1. Let Q,P be the canonical observables of a quantum harmonic oscillator. The 
symmetric ordering on polynomials in Q, P is the operation defined by replacing each monomial 
X x ...X n by 



where Xi is either Q or P. 

The following theorem shows that the symmetric products of Hermite polynomials behave similarly 
to their classical counterparts in what concerns orthogonality between polynomials of different orders. 
However for same order polynomials it can be directly checked that the orthogonality relation does 
not hold. 



for any polynomial Y in Q, P of order strictly smaller than n + m. 

In particular, S[H n (Q / ' \[2o)hl m (P '/ '^/2a)} is the unique projection of (\/2a)~ n ~ m S[Q n P m ] onto the 
orthogonal complement of polynomials of order strictly smaller than n + m. 

Proof. Considering the analogy with the commutative case, it is natural to use the theory of Wigner 
functions for computing the inner product. We briefly review here the necessary background and 
refer to [22] for more details. 

To any trace-class operator A G 72(£ 2 (R)) we associate its Wigner function Wa € L 2 (R 2 ) defined 
as the inverse Fourier transform with respect to both variables of 




TGS(n) 



Theorem 3.2. Let (■,-) c j> cr be the inner product (|3.6|) on B(L 



2 (K)). Then 



(S[H n (Q/V2a)H m (P/V2a)l Y)^ = 0, 



W A (u,v) = Ti(Aexp(-i(uQ + vP)). 



1. The map W : A — » Wa is an isomorphism between 72(X 2 (1R)) and L 2 (R 2 ) and 




2. The expectations of symmetric polynomials in Q, P can be computed as 




(3.7) 



3. The Wigner function of a product AB can be computed as [2"T] 




l q p 

1 q' p' 
1 q" p" 




8 



Inserting W^{q,p) = exp(-(q 2 + p 2 ) /2cr 2 ) /2ira 2 and W Y (q,p) = Y(q,p) into ([378]) we get 



1 



4ct 2 (2tt) 3 / 2 
1 



F(g',p')exp(-(g" 2 +p" 2 )/2a 2 ) 



exp 





1 


<1 


P 




-2iDet 


1 


4 


P' 






1 


q" 


P" 





dq'dp'dq"dp" 



4V27 



= 1 1 Y(q',p') exp(-2i(qp' - q'p)) exp(-2a 2 ((p - p'f + (q - q'f)dq'dp' 



Any polynomial Y(q' ,p') of order strictly smaller than m + n can be expresses as a linear combination 
with coefficients a(a,b,c,d) independent of (q,p,q' ,p') of terms 

IV, j/) := q c p d H a (V^aiq 1 - qfj H b (yfatf - p)) , a + b + c + d < m + n. 

In this case the above integral can be computed explicitly by using the following property of Hermite 
polynomials [9] 

H k (q)e-« 2 = (-l)™^exp(-q 2 ). 
By a change of variables we have 

W T ^p) = - * / f H a (q')H b (p') cxp(-q /2 - P' 2 ) cxp(-2l(qp' - pq'M^dq'dp 1 
= Cq a+c p b+d exp(-(q 2 +p 2 )/2a 2 ) : 
for some constant C = C(a, b, c, d, a). Now using (|3.7|) we get 
Tv(cj)S[H n (Q/V2a)H m (P/V2<j)}Y) 
= 2 ^JJ H n (q/V2a)H m (p/V2a)W Y ^(q,p) dqdp 

= 2nJ2 a(a, b, c, d)C'(a, b, c, d, a) J J H n (q)H m (p)p b+d q a+c exp(~(q 2 + p 2 )) dqdp = 0, 
where in the last line we used the following property of Hermite polynomials 

J H k (q)q c exp(-q 2 )dq = 0, for c < k. 

The second statement of the theorem follows from the fact that the leading term of the polynomial 
S[H n (Q/V2a)H m (P/V2a)} is (V2a)- n - m S[Q n P m ]. 

□ 



4 Quantum Central Limit Theorem 

In this section we give a brief review the quantum Central Limit Theorem (CLT) |25j . Similarly to its 
classical counterpart, the CLT shows how certain Gaussian states of quantum harmonic oscillators 
emerge as the limit of 'fluctuations' of collective observables of a large number of identically prepared 
systems. Let ^ be a faithful state on M(C d ), i.e. 

*(A) = Tt(pA), AeM(C d ), 
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with p a strictly positive density matrix. To (M(C d ),p) we will associate an algebra of canonical 
commutation relations carrying a Gaussian state ip. By 'diagonalisation', the algebra can be easily 
identified with the tensor product of d(d— l)/2 quantum harmonic oscillators and the commutative 
algebra The state ip factorises as well into a product of thermal equilibrium states and a 

(d — l)-dimensional Gaussian distribution. 

Definition 4.1. Let M(C d ) sa be the space of selfadjoint matrices. On M(C d ) sa we defined the 
(positive) inner product 

AB + BA 



(A, B) p := Tr(pAo B), where AoB:= 



2 



and we denote by L 2 (p) the Hilbert space (M(C d ) sa , (•, -)p) • By ^o(p) we denote the orthogonal 
complement of the identity, i.e. L^(p) ={ie L 2 (p) : (A, l) p = Tr (pA) = 0}. 

We further define the symplectic form a on L 2 (p) 

a(A,B) :=±Tr(p[A,B]). 

Roughly speaking, the algebra of canonical commutation relations C 'C 'R(Lq(p) , a) is generated by 
the canonical variables M(A) satisfying the conditions 

V(A + B) = M(A)+M(B), 

M(A) = M(A)*, 

M(\A) = XM(A), A e R, 

[M(A),M(B)\ = i2a(A,B)l, 

for all A, B G Lq(p). The harmonic oscillator is a particular case where the space of canonical 
variables is spanned by Q and P satisfying the commutation relation [Q, P] = il. 

Alternatively, one can define the unitary Weyl operators S(A) := cxp(iM(A)) which satisfy 

S(A)* = S(—A), S(A)S(B) = S(A + B)exp(-i<r(A,B)), A,BeL 2 ( P ). 
On CCR{Lq(p),o-) we define the quasifree state 

<p(S(A)) := cxp (-l\\A\\l) , \\Af p = (A,A) P , (4.1) 

with respect to which M(A) has normal distribution N(0, \\A\\ 2 ). 

We will now 'diagonalisc' CCR(L^(p) 7 a) and construct an explicit Hilbert space representation of 
the algebra and the state <p. Let 

d 

p = ^ / \i\e i ){e i \, (4.2) 

i=l 

be the spectral decomposition of the density matrix p and assume that Ai > ■ ■ ■ > A<j. The Hilbert 
space Lq(p) decomposes as direct sum of orthogonal subspaces R p ® where 

H p := Un{A G L 2 (p) : [A, p] = 0}, and nj = Un{T jjk , l<j^k<d}, (4.3) 
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where Tj t k are the matrices 



Tj t k — iEj.k — iEkj for 1 < j < k < d: 

T k ,j = E hk + E k!j fori <j<k<d. (4.4) 
with Eij the matrix with entry equal to 1, and all others equal to 0. 

The elements S(A) with A <G % p generate the center of the CCR algebra, which is isomorphic to 
the algebra of bounded continuous functions Cb(R d_1 ). Explicitly, we identify the coordinates in 
M d_1 with the basis {di :— —fiil + Ei : i : i = 1, . . . , d — 1} of H p . Then the covariance matrix of the 
'classical' Gaussian variables d := M(di) is 

{d t ,d j ) p = Tripd.dj) = Sijm - mnj := [V(X)]ij. 

As for the space , we identify basis 

tj,k ■= T^ k l^2\\ij - /ifc|, j ^ k, (4.5) 

which is both orthogonal and symplectic i.e. 

&(tj,k,tk,j) = -1/2, j < k, and a(t jt k,ti, m ) = for {j, k} ^ {I, m}. 

This means that for each (j ^ k) the operators 

Qj k := M(tj, k ), := B(tfe,j) 

form a pair of canonical coordinates of a harmonic oscillator, and all such d(d — l)/2 oscillators 
commute with each other. Moreover, since the basis is orthogonal, the state ip factorises into a 
product of thermal equilibrium states <p aj k with variances 

4k = = ^ P h) - Whk\\ 2 p = Tr(/4,) = 2 J^+^ fc| > \ (4.6) 

Remark 4.2. Tfte above decomposition assumes that p has a non- degenerate spectrum. It is easy to 
verify that a similar result holds in the general case, with some of quantum oscillators being replaced 
by bivariate classical normal variables. 

We pass now to the formulation of the Quantum CLT. Consider the tensor product ®fc =1 M(C d ) 
which is generated by elements of the form 

A {k) = 1 ® • • • <g> A® ■ ■ ■ <g> 1, (4.7) 

with A acting on the fc-th position of the Hilbert space tensor product (C^)®". We are interested 
in the asymptotics as n — > oo of the joint distribution under the state p® n , of 'fluctuation' elements 
of the form 
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Theorem 4.3. [Quantum CLT] Let Ai,...,A s € M(C d ) sa such that Ti(pA l ) = 0. Then 

lim Tr (U ¥ M)j = V [f[(B(^))J , 

lim Tr (p®» (nexp(iF n (A0) ) J = (f[ W J ■ 



\i=i 



For later purposes we need the following corollary which combines the CLT with the Law of Large 
Numbers. Let C £ M(C ) sa with expectation c = Tr(pC) and define the 'empirical average' 



1 - 1 
n (C) := - V C**> = cl + -=W n {C - cl) 



(4.9) 



Corollary 4.4. Let Ax,.. .,A S € M{C d ) sa such that Tv(pA 2 ) = and let Ci,...Q € M(C d ) sa itrcift 
expectations a := Tr(pCj). For any non-commutative polynomial P[X\, . . . X s , Y\, . . . ,Y{\ in the free 
variables Xi, Yj we have 



lim Tr(p^ n P[¥ n (A 1 ),...,¥ n (A s ),F n (C 1 ),...,¥ n {C l )}) = if 



),...,B(A fc ), Cl l,...,Ql]) 



Proof. Write P n (C) as in (|4.9|) and apply CLT while noting that any monomial containing at least 
one term ¥ n (d — al) converges to zero. □ 

Remark 4.5. The quantum CLT provides two convergence results, for moments and for exponen- 
tials. Ln the commutative case, the two are equivalent and are also equivalent to convergence in 
distribution [$]. This follows from Levy 's continuity Theorem ( convergence in distribution is equiva- 
lent to the convergence of the characteristic functions) and the fact that convergence in distribution is 
equivalent to convergence in moments, provided that the limit distribution is completely characterised 
by its moments. In the quantum case we cannot talk about a joint distribution of non-commuting 
variables and the two results seem to be unrelated. 



For later purposes we formulate now the extension of Theorem 13.21 to the case of the algebra 
CCR(Lq(p)). We denote by {Fi, . . .F^-i] the orthonormal basis of Lq(p) defined earlier 

{di : i = 1, . . . d - 1} U {t jtk :l<j^k<d}. 

arranged in a particular order. 

Corollary 4.6. Let V c C L 2 (ip) be the linear span of polynomials of order up to c in the canonical 
variables G(Fi), and denote by Z c its orthogonal projection. Then the symmetric product of (non- 
commuting) Hermite polynomials 



S 



I]ff m( (G(fi)) 



is orthogonal on V c —i, for any set m := {nii} of multiplicites with |m| = c. In particular, 
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Proof. The result follows form Theorem 13.21 applied to each of the quantum oscillators (Qj y k, Pj,k) 



in CCR(Lq(p)) together with the classical counterpart for the 'classical' canonical variables G 



Indeed by writing the symmetric operators and the projections in terms of tensor products over the 
d(d — l)/2 oscillators and (d — 1) independent normal variables, we can derive the corrollary from 
the following lemma. 

□ 

Lemma 4.7. Let 

Ha = %a,i, a = 1 . . . fc, 

be arbitrary decompositions of Hilbert spaces T-l a into orthogonal subspaces with projections P a i- Let 



Qa,i — d3 Pb,'. 



6=1 

be the projections onto vectors in H. a of 'order less or equal to a '. Similarly let Z c be the projection 
onto the vectors of the tensor product Hi ® • • • ® Hk of total order less or equal to c 

Z < : = P bmi ®"-®Pk,m h , 

where the direct sum runs over all multiplicities m = {m a } with |m| < c. 
Then for any k-tuple {ip a = Qm a if>a : a = 1, . . . fc} with |m| = c we have 

Zc-l \ 6d V>a ) = (R) Pa.raAa = 6t) Qa.rn^a- 



5 The Hoeffding decomposition 

This section deals with the generalisation of the concept of Hoeffding decomposition to non-commuting 
variables. When seen as a Hilbert space construction, the difference between classical and quantum 
cases disappears and one obtains the same result as the classical one. However we include the proofs 
for the sake of clarity. 

As before, given H € M(C d ) and any 1 < i < n we denote by ffW the operator on (C d )® n 

H (i) := 1 <E> ■ ■ ■ (g) 1 <E> H <E> 1 <E> ■ ■ ■ <g> 1 (5.1) 
with H acting on the i-th term of the tensor product. 

Definition 5.1. Let p® n € M(C d )®" be the joint state ofn independent identically prepared systems. 
Let A be a subset of {1, . . . , n}. The conditional expectation 

E(-\A) : M(<C d )® n M{C d )® n 
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is the linear extension of the map 

K(K 1 ® ■ ■ ■ ® K n \A) := H Tr(pKj) ■ ]J Kf ] 

jeA c ieA 

In particular we define 

E{K\9) := E(K) = Tr(p® n K). 

Lemma 5.2. Let A be a subset of {1, ... ,n}. Let L 2 (p® A ) be the subspace of L 2 {p® n ) consisting of 
operators acting only on tensors in A 

L 2 (p^)=Lin{n^| 
[ieA 

and let Qa be the corresponding orthogonal projection. 
Then 

E{K\A)=Q A {K), KeL 2 (p® n ). 

Remark 5.3. Note that 

1. Qn) is the one dimensional projection onto the identity 1 seen as a vector in L 2 (p® n ); 

2. the map A M> L 2 (p® A ) respects the order by inclusion 

L 2 {p® B ) C L 2 {p® A ) iff Be A; 

3. the map A h-» L 2 (p® A ) respects intersections 

L 2 (p® B ) n L 2 (p® A ) = L 2 (p® BnA ), or Q A -Q B = Qahb- 

By the same construction as in the classicai case we can define for each A a subspace Ha of L 2 {p® A ) 
consisting of vectors which arc orthogonal to all spaces Hb with B ^ A. 

Definition 5.4. Let A be a subset of {1, ... ,n}. We denote by Ha the subspace of L 2 (p® A ) con- 
sisting of operators which satisfy 

E(K\B)=0, for all B : \B\ < \A\. 

It can be ckcckcd directly that the first such projections are 

P $ {H) = E(H)1 
P {i} (H) = E(H\{i})-E(H)l 
P {iJ} (H) = E(H\{i,j})-E(H\{i})-E(H\{j})+E(H)l 

Lemma 5.5. Let Pa be the orthogonal projection onto Ha- Then 

Pa-Pb=0 7 A^B, 

i.e. the spaces {Ha} are mutually orthogonal. 
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Proof. Suppose that B does not contain A. Let H <G Ha, then 

E(H\B) = E(H\A n B) = 
hence Q B (H)=0. Since P B < Q B we obtain P B (H) = 0. □ 
Theorem 5.6 (Hocffding decomposition). TTie sets of projections {Pa} and {Qa} are related by 

Pa=J2 (-1)' AHS| Qb and Qa = E P b- 

BcA BcA 

Proof. Let us denote P A := Ebca( _1 ) |AHB|( 2b- Let C g A By multiplying the right side of the 
first identity by Qc we get 

PaQc = E(- 1 )' AHS| ^nc= DQ D E (-1) |AHB| 

BCA DCC DCBCA 

= E ^ '^eVi^-i^ ~ lq ) = 

Dec i=o \ 3 J 

since the interior sum is equal to zero by the binomial formula. This means that Ran {Pa^ C "Ha- 
Now it suffices to show that for any _ff e L 2 {p® n ) and € "Ha we have 

(H - P A {H),K A ) p<Sn = (P A (H) - P A (H), K A ) p ®n = 0. 

By the definition of Pa we have 

(H-P A (H),K A ) p ®„ = (H-Q A (H),K A ) p ® n - E {-^ A ^ B ^(Qb(H),Qb(K a )) p ®n = 



B<gA 



since (1 - Q A )P A = and Q B P A = for B C A. 



Using the first identity, we prove the second identity by induction onr= For A = we have 
-Pa = Qa by definition. Suppose that the assertion is true for all sets of size 0, ... r — 1, and let A 
be a set of size r. Then from the first identity, and using the induction hypothesis we get 

Qa = Pa - E (-1) |AHB| E Pc 

B^A CCB 



= ^-E^Ec- 1 ) 



CgA 



CCA 

The interior sum in the second line is over {B : B ^ A and B D C} and can be easily computed 
using the binomial formula. 

□ 
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6 Quantum U- statistics 



In this section we introduce the notion of quantum [/-statistic and prove the main result formulated 
in the convergence Theorem 16.71 

A quantum U-statistic is defined in analogy to its classical counterpart. We replace the sample 
X±, . . . , X n by n independent, identical quantum systems, each of them prepared in a state p € 
M(C d ). The joint state p® n is invariant under the action of the symmetric group S(n) 

p® n = n d (r) p® n n d (T)* , r € S(n), 

where 7Trf(r) is the natural unitary representation of S(n) on (C d )® n 

7Td(r) : 1p X ® ••• ® 1p n -0 r -l (1 ) ® Ipr-i(n). 

Additionally, (C 1 )®™ carries the tensor representation of the special unitary group SU(d) 

TT n (u) : 1p\ ® ■ ■ ■ ® 1p n H-> Ulpl ® • • • <S> U-0n, U € SU(d). 

These two representations are related by the following duality [TTj . 

Theorem 6.1 (Weyl duality). Let 7r„ and nj, be the representations of SU(d) and respectively 
S(n) on (C d )® n . Then the representation space decomposes into a direct sum of tensor products 
of irreducible representations of SU{d) and S(n) indexed by Young diagrams A with d lines and n 
boxes: 

A 

n n = ®7TA ® 1 IC X , TTd = ^ 1« A ® T^A- 

A A 

We will need this result only in as much as saying that the matrix algebras A n :— A\g(ir n (SU(d))) 
and B n :— Alg(Tr d{S(n))) generated by the two group representations are each other's commutant: 
A n = B' n . Recall that the commutant of a set of operators M. C B(H) is defined as 

M! = {N e B{U) : [N, M] = 0, any M G M}. 

The following algebraic lemma will be useful in analysing the asymptotic distribution of /7-statistics. 

Lemma 6.2. The algebra A n of operators which are invariant under permuatations is generated by 
the collective observables A :— Y17=i-^ with A £ M(C d ). Moreover, any symmetric (permutation 
invariant) operator can be written as a 'symmetric polynomial' in collective observables, i.e. a linear 
combination of operators of the form 

S(Ai,...,A k ):=— A T{1) ...A T[k) . (6.1) 

' rSS(fe) 
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Proof. Let A := Alg(iv n (SU(d))) and let B := Alg^S^n)) be the matrix algebras generated by the 
two group representations such that B' = A. Any element of SU(d) can be written as u — exp(iA) 
with A £ M(C d ) selfadjoint. Then ^(u) = exp(iA) and by differentiation one can conclude that 
{A : A £ M (C d )} generate the algebra A. 

The second statement can be proved by induction with k. Indeed the map A — > A is a Lie algebra 
morphism so that [A, B] = C where C :— [A, B]. For k — 2 

S{A U A 2 ) = i(Aii 2 + A 2 Ai) = A 1 A 2 + ^[A 2 ,A 1 } = A 1 A 2 + -A, A := [A 2 ,Ax] 

hence A\A 2 is a linear combination of symmetric products. Similarly, but using commutators re- 
peatedly we get 

S{Ai,...,A k+1 )=A l ...A k+1 +R(k) 
where R(k) is a linear combination of symmetric monomials of order at most k. 

□ 

In analogy to the classical case, a selfadjoint operator K on (C d )® r satisfying K = 7r r (r) K-k t [t) for 
all r £ S(r) will be called symmetric kernel of order r. By tensoring with the identity operator on 
(C d )®"- r , the kernel K will be identified with its ampliation to (C d )® n , acting as K ® 1 <g> • • • ® 1. 
Sometimes it is useful to express K as & sum of tensor products 

p 

K = ^K ail ® K a . 2 - ■ ■ ® K a . r 

a=l 

with K a i selfadjoint operators. Then (the ampliation of) K can be written as 

p m 
a=l 1=1 

where we used the notation (|5.ip . Similarly, for any unordered subset /3 = {/3\, . . . , j3 r } of {1, ... ,n} 
we define by 

o=l 1=1 

which is the version of K acting on the tensors on positions . . . , /3 r . 

Definition 6.3. Let K be a symmetric kernel of order r. The U -statistic with kernel K is defined 
by 

^-(ffZ^ (6-2) 
where the sum is taken over all unordered subsets (3 of integers from {1, . . . ,n}. 

As in the classical case, the {/-statistic can be seen as an unbiased estimator of the parameter 

6 := Ti{K P ® r ) = Tr(p® n C7 n ) 
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which depends on the unknown state p. The difference with the ciassical case is that the terms 

of the sum (|6.2[) may not commute with each other, in which case the U-statistic cannot be reduced 

to a classical one by simultaneously diagonalising all 

We will now use the Hoeffding decomposition discussed in section [5] to analyse the asymptotic 
behaviour of U n . Since the variance is a Hilbert space rather than algebraic property it can be 
computed in precisely the same way as in the classical case. 

Lemma 6.4. Let U n be a U -statistic with kernel K of order r. Then U n has the following decom- 
position 

' r 



1=1 



where U nj i is the U-statistic of order I with kernel Ki given by the projection Ki := Pn nK. 
Moreover, the terms U n j are mutually orthogonal in L 2 (p® n ) and 



2 / \ -l 
r\ ( n y 



Var p (f/„) = £ ( \ f'J E p (K, 



l=i 

If Ki = ■ • • = K c -i = and K c ^ then Var p ([/„) is of order 0(n~ c ). 
Proof. Using the Hoeffding decomposition we can write 

r 

A 1=0 \A\=l 

where the first sum is taken over all subsets A of {1, . . . n}. By symmetry all Ka with |^4| = I have 
the same [/-statistic [/„,;■ The coefficient (^) follows from a simple counting argument. 



The proof of the second statement is identical to that of Lemma 12.61 □ 

The next Lemma deals with the algebraic structure of the [/-statistic. Roughly speaking a [/-statistic 
can be expressed as a polynomial in fluctuation observables with (non-commutative) coefficients 
which are products of empirical averages. Using this expression one can then easily derive the 
convergence of moments of [/-statistics by applying Corollary 14.41 The main idea is best conveyed 
through an example. If 

K =y E ^r(i)® A T{2) ® A T(3) , 

' tGS(3) 



then 



V" a(Pi) a(M a(M 



n V*\3j Un ~ „3/a 2^ A i A 2 ^3 



= S(¥(A 1 ),¥(A 2 ),¥(A 3 ))-P n (A 1 oA 2 )o¥ n (A 3 )- 

P n (Ai o A 3 ) o ¥ n {A 2 ) - ¥„{A 2 o A 3 ) o ¥„ {A 1 ) + -% = P n (S(A 1 ,A 2 , A 3 )). 

The sum in the first line runs over all sets of different indices (/3i, f3 2 , j3 3 ) from {1, . . . , n} (position 
taken into account), and can be obtained by subtracting from the symmetric product of fluctuations, 
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those terms for which either two or three position indices coincide. These terms can again be 
written in terms of symmetric products between fluctuation operators ¥ n (Ai) and empirical averages 
P„(AioAj), or just ¥ n {S{A 1 ,A 2 ,A 3 ))/ s fti. 

Definition 6.5. A polynomial of order I in the non- commutative variables X\,...,X t is called 
symmetric if it is of the form 

I 

P(X u ...,X t ) = J2 E c 3l _ 3k S(X n ,...,X ]k ) 

k=l {j lt ...,j k } 

where 

S(X h , . . . , X jk ) := — ^ X Jt(i) ■ ■ ■ x ir( k ) ) 

' rSS(fc) 

and the real valued coefficients Cj lt ...j k are symmetric functions of the indices, with the interior sum 
taken over all subsets {ji, ■ ■ ■ ,jk} of {1, ... ,t}. 

Lemma 6.6. Let Ai, . . . , A x e M(C d ) sa such that Tr(pAi) = for all i = 1, . . . ,1, and let K be the 

symmetric kernel of order I 

K:=- A r{1) ®---®A r{l) . 

' reS(l) 

Then there exist symmetric polynomials Po, . . . , Pi- 2 of order strictly smaller than I such that the 
corresponding U -statistic U n can be expressed as 

^2^) U n = S(¥ n (A 1 ),...,¥ n (A l )) 

1-2 

+ ^n-*/ 2 P t (F„(A 1 ),...,F n (A),Pn(Si),...,P n (B p )) (6.4) 
t=o 

where p is a natural number depending only on I and B\, . . . , B p € M(C d ) sa . 
Proof. Let us first note that the polynomials have a very simple form for small I. If I = 1 then 
U » = ^Il A i ) =n~ 1/2 ¥ n (A 1 ), so -L(^ju n = ¥ n (A 1 ). 

If I = 2 then 



(^) Un = l E A< f l)A 2 2) =¥ n (A 1 )o¥ n (A 2 )-F n (A 1 oA 2 ). 



2 (n 
n 



The case I = 3 was discussed above. In general 



(/3i,-,A) 

S(¥ n (A 1 ),...,¥ n (A l )) 

-^' /2 E E S{A i :i€L 1 )M...S{A i :ieL m )M 

V (7i,...,7|-p|) 
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where the first sum of the bottom line runs over all partitions into subsets V = {£1, . . . , £1751} of 
the set {1, . . . , I}, and the interior sum is taken over all sets of position indices (71, . . . , 71751) taken 
from {1, ...,n}, which are different from each other. We have also used the shorthand notation 
S(Ai : i e L\) for the symmetric product of those A4 with indices in L\. 

By repeating the procedure a finite number of times we obtain that the left side is equal to a 
symmetric polynomial in collective observables whose terms look like 

±n^' 2 S(B~ 1 ,...M), 

The operators of B\, . . . , Bk € M(C d ) sa can be written as nested symmetric products depending on 
the hierarchy of successive coarser and coarser partitions of {1, . . . , 1} leading to the final partition 
{Mi, . . . ,Mk}- For our purposes it suffices to describe the operators associated to subsets of one 
and two elements: 

M a = {i} =^ B a = Ai, 
M a = {i,j} => B a = A i oA j . 

We can now distribute the powers of n inside the symmetric product such that we obtain F„(Aj) for 
singletons and P n (Aj ° Aj) for two elements subsets. If the partition contains subsets M a of more 
than three elements then we use a nT 1 factor for each of them to obtain ¥ n (B a ), and we are left 
with a factor nT 1 / 2 with t > 0. 

□ 

Next we show that the (properly normalised) [/-statistics U n converge in moments to a symmetric 
polynomial in the canonical variables of the algebra CCR(L 2 (p)). Intuitively this follows from 
Lemma 151)1 and Corollary |4.4[ but as in the classical case, formulating the result in a coordinate-free 
fashion seems to be more difficult. Therefore we will decompose the kernel and the [/-statistic with 
respect to the natural ortho-symplectic basis of L^(p) described in section |4j Recall that L 2 (p) splits 
into 3 orthogonal subspaces 

L 2 {p) = CI © L 2 (p) = CI © n p © Hf, 

where Hp is the 'classical part' consisting of matrices D that commute with p and satisfy Tr(pD) = 
0. The corresponding canonical variables M(D) form the centre of CCi?(L§(p)), hence the label 
'classical'. The 'quantum part' Hj; has an ortho-symplectic basis {tj t k '■ 1 < j ^ k < d} such that 
(Qj,k '■= &(tj,k), Pj.k = C(ifej)) are canonical variables of d(d— l)/2 commuting quantum oscillators 
indexed by pairs (j < k). For simplicity we denote by {Fq := 1, F\, . . . , Fcp-i} the basis elements 

{1} U {di : i = 1, . . .d - 1} U {t jtk : 1 < j ■ ^ k < d} (6.5) 

arranged in a particular order. Let 

K := y~] k(i 1 ,...,i r )F ll <g> • • • ® F ir 
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be the decomposition of K with k(i\, . . . , i r ) a symmetric function. Then 

Ki := P{i,...,i } {K) = Yj HH,.-.,ii,0,... 7 0)F ll ®---®F ll ®l r - 1 

ii,...,ij>l 

^ h{i u ...,ii)F h ® •■■®F il ® l r ~ l 

ii,...,ij>l 

^ ki(h,...,ii)F iu ... til ® F"' 

{ill •••,«;} 

where the last sum is taken over unordered sets . . . , i{\ with all ij 7^ 0, 1^,, i, is the correspond- 
ing symmetric kernel and fc; is a symmetric function of the indices. 

The unordered set of indices {i\, . . . , i{\ is in one-to-one correspondence with the set of multiplicities 
m := {mi, . . . , md 2 -i} where m p = \{j : ij — p}\ and we will further write F m := i 7 "^,...^, and express 
X; as 

J£j= XI ^® r ' ( 6 - 6 ) 

meMi 

where the sum runs over the set of partitions 

d 2 -i 



i=i 



M ( := I m = {mi, . . . , m d 2_ 1 } : mi > 0, rrij = I 
By inserting into (|6.3[) we get 



^=eCK<=e(T) e 

1=1 ^ ' H ^ ' m£Afi 



(6.7) 

Theorem 6.7 (convergence in moments). Let U n be the U-statistic with kernel K of order r such 
that K\ = . . .K c —i = ^ K c for some 1 < c < r, with Ki given by (|6.6I) . Then the following 
convergence in moments holds 



lim Tr ! 



(>" (n c / 2 (f/„ - E p (K))) ) = ^), peN, 
where ip is the Gaussian state (|4.1[) and U is the polynomial in canonical variables of CCR(Lq(p)) 



U 



Q E 



Remark 6.8. Note that the symmetric product in (|6.8p is equal to the (usual) product of mutually 
commuting operators 

d-l 

j<k i=l 

where each term is associated to an oscillator or a classical normal variable. It is known that a 
polynomial in canonical variables may not be essentially selfadjoint on the domain consisting of 
finite particles vectors \23]j . This is related to the famous Hamburger moment problem \31}/ and the 
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fact that the distribution of powers X k of Gaussian variables X are not uniquely determined by their 
moments JEM - 

For classical (commutative) U -statistics the above convergence holds in the stronger sense of con- 
vergence in distribution. In the quantum case this may still be true but it is not clear to us how to 
identify the limit distribution. However, in section [7] we will show that convergence in distribution 
holds for non-degenerate statistics and for order two kernels. 

Proof. With 6 = E p (K) = Tr(p® r K), let 



u » - 9 = E (]) u ^ = E (i) E k ^ 

l=c ^ ' l=c ^ ' mel/. 



n 



' ' / "' ^ U n>m = S [ J[ F n (Fi) m « ] + ]T n- t/2 Pt, m , m G M, 



'/a V i 



be the decomposition of ?7 n into simpler [/-statistics with kernels i 7 ^. 

By Lemma l6.6l thc scaled components U n . m can be expressed as polynomials in variables of the form 
^n(Fi) or P n (Bj) having the form 

1-2 

—til n _ «j 

4=0 

where we used a shorthand notation for the symmetric product (see definition 16. 5p of fluctuations 
operators F„ (F^ ) , each appearing with multiplicity rrij . 

In particular, if I > c then n c l 2 U n ,i has coefficients of order n~^~ c >/ 2 and its contribution to the 
moments of n c ^ 2 (U n — 9) converges to zero as n — > oo. By applying the Corrolary |4.4l of the quantum 
CLT we get 

Tr(n c / 2 ([/„-#)?) ->m(p) 
where m(p) is the p-th moment (with respect to <p) of the following polynomial 



Here Pq is the polynomial of order strictly less than c, obtained by replacing the emipirical averages 
P n (B) by the average Tr(pB), and the fluctuations by their canonical variables. Although this could 
be computed using the combinatorial arguments employed in Lemma 16.61 we prefer to give a more 
transparent derivation based on the following orthogonality argument. 

From the construction of the Hocffding decomposition we know that U n , c is orthogonal in L 2 {p® n ) 
on all [/-statistics U n ^ m with |m| < c. By appropriate scaling and taking the limit n — > oo we then 
obtain that the limit polynomial is orthogonal in L 2 (tfi) on all with |m| < c. 

We prove by induction with respect to c that 

u % = s (n^w))) > v i m i ^ c - ( 6 - 9 ) 

If c = 1 then the [/-statistics are proportional to fluctuation operators and by CLT we have that 
are just the canonical variables which together with the identity span the space of polynomials 
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of order 1. Suppose that the induction hypothesis is true for c — 1 and let m be a multiplicity with 
|m| = c. Since U££ is orthogonal on all U™, with |m'| < c, it is orthogonal on all polynomials of 
orders less than c. This follows from the fact that the symmetric products of Hermite polynomials 
of order less than c span the set V c -\ of all polynomials of order less than c. 

Let denote the projection onto . Then 



since Pq G V c -\. Using Corollary 14.61 we obtain 



U m = ZtxS (j[G(Fi) m ^j = S (j[H mi (G(Fi)) 



which ends the proof of the induction hypothesis. 

□ 



7 Non-degenerate and second order kernels 

In this section we show that Theorem 16.71 can be strengthened in some cases of particular interest 
to applications: non-degenerate kernels, and kernels of order 2. 

Corollary 7.1. A kernel K of order r is called non-degenerate if K\ ^ 0. In this case the associated 
U -statistic is asymptotically normal: 

s/E(U n - 9) A AT(0, r 2 a), & = ^{pKl). 

Proof. The convergence in moments to N(0, r 2 ^) follows from Theorem 16.71 Since the limit distri- 
bution is uniquely determined by its moments, this implies that \fn(U n ~Q) converges in distribution 
to iV(0,r 2 ^) (cf. 0). □ 

Theorem 7.2. Let K := 81 + J^i j>o ^ .i^ ® Q be a degenerate kernel of order 2. Then the 
associated U -statistic is given by 

U n -6= ^-j ^ k^{W n (Fi) o F n (Fj ) - P n (F t oF 3 )), 

i,j>0 

and 

(n - l)(U n - 0) A £/ := ^ fci,i>o(G(^) o G(Fj) - 6^1). 
where the the operator U is selfadjoint. 

Proof. The form of U n can be derived as in Lemma 16.61 and the convergence in moments can be 
deduced from Theorem 16.71 
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It is well known that in the case of a CCR algebra over a finite dimensional space, any quadratic 
form in canonical variables is selfadjoint, and its distribution is uniquely determined by its moments. 
We give here a sketch of the proof and refer to [28] for more details. By Nelson's analytic vector 
Theorem, the selfadjointness follows from the fact that finite particle vectors are analytic for U, i.e. 
satisfy \\U p tp\\t p /p\ < oo for some t > 0. This can be shown by using the inequality 



\\a1...aI p m<v¥+W- 

where A\ represents a creation or annihilation operator and tpk is a vector of k excitations. 

To prove uniqueness of the distribution it is enough to show that the moments m p := 4>{U P ) satisfy 
\m p \ < CD p p\. The latter follows from the above inequality with ipk being the vacuum vector in an 
appropriate representation of the CCR algebra. 

□ 

In the next two examples we consider order 2 kernels on C 2 with reference state given by the density 
matrix 

'-(;.-*)• 

so that {1, a x , <x y , <r z — (2 A — 1)1} forms an orthogonal basis of L 2 (p), where a x ,a y , <j z are the Pauli 
matrices 

Example 7.3. Let K be the symmetric kernel of order 2 given by K = ® <r y + a y ® <j x ). The 
associated [/-statistic is 

and by Theorem 17.21 it converges in distribution to the law of U := 2 (2 A — \)Q o P with respect to 
the thermal equilibrium state (f> a with a 2 — (2 (2 A — 

Example 7.4. Let H be the symmetric kernel of order 2 given by H = a x €D a x + a y ® <r y , The 
associated [/-statistic is 
2 /n> 









-A 


I 


i • 


a v = ( 








, i J 



n 



{^i} 

\2 1 up C_ \2 



= F n ( ( 7 !C )^+F n ( ( T 3/ ) 2 -21 
and converges in distribution to 

2(2A - l)(g 2 + P 2 ) - 21 = 4(2A - 1)(N - E(N)l) 



8 Applications to quantum statistics 

In this section we sketch a couple of quantum statistical applications of the limit theorems for 
degenerate order two kernels. A more detailed analysis will be pursued in a separate publication. 
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8.1 Testing for a particular state 



Let p £ C d be a fixed diagonal density matrix with distinct eigenvalues Ai > •• • > A<j (cf. (|4.2[) ) . 
Given n i.i.d. samples from an unknown state a € C d , we would like to test whether a is equal to 
p. Roughly speaking we would like to distinguish the following hypotheses 

H :a = p, Hi : a ^ p, 

and we ignore here the obvious difficulties related to the fact that the two sets of states are not 
'separated'. 

The test t n is a binary measurement with outcomes in {0, 1}. When the result i is obtained, we 
accept the hypothesis Hi. We denote the corresponding POVM elements by T n and 1 — T n such 
that 

P(*„ = 0|(t) = Tt(oT n ), P(t n = l\a) = Tr(a(l - T„)). 
Adopting the standard statistical set-up [21] we define the type I error probability of the test t n by 

a n := F(t n = l\a = p) = Tr(p(l - T n )). 

The type II error probability of t n at a point a ^ p is 

P n (a) :=P(t n = 0\a)=Tr(aT n ). 

Let < a < 1 be a fixed 'significance level'. We say that t n is a level-a test if a n < a. Among such 
level-a tests we would like to identify those which have the smallest second type error probability 
P n {cr). In general a 'uniformly most powerful' test over all a may not exist but sometimes one can 
find an optimal one in a restricted class of tests such as unbiased, covariant, etc. 

Here we propose a test based on a [/-statistic with kernel 

d d 
K = ^(A,l - E iti ) <g> (A»l - E iti ) + ( T j,k ® T jtk ), 

i=l i¥=j=l 

where Tj^ are the selfadjoint matrices defined in (|4.4I) . Then K is an unbiased estimator of ||a — 
i.e. 

Tr(a® 2 K) = \\a-p\\ 2 2 :=e. 

From its expression we see that U n is degenerate at the state p, that is K\ — 0. By applying Lemma 
I7.2l we get 

d 

nU n A U := £(G? - A*(l - A*)l) + ^(0?,* + p h ~ 2 °h)H* 

i=l j<k 

where {Gi} are centred Gaussian variables with covariance matrix Vij and {Qj,k, Pj,k} are canonical 
coordinates of d(d — l)/2 quantum oscillators prepared independently in thermal states (f>j t k with 
variance (|4.6p . Let 1(a) = [a,b] be a a-significance interval for U, i.e. P(U ^ [a,b]) = a. 

We devise the following test: measure U„ to obtain random result u n and then accept Hq if 
nu n £ 1(a) and Hi if nu n is in the 'critical region I(a) c . The corresponding POVM element 
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T n is the spectral projection of nU n corresponding to the interval I {a). By construction this is an 
asymptotically level a test, and its type II error at a fixed a ^ p can be shown to go exponentially 
fast to zero by considering the asymptotic distribution of U n under cr®" . In this case the statistic is 
non-degenerate and we have 

y/n(U n - \W-p\\ 2 ) AiV(0,l/) 

for some variance V = V(p, a). In other words nU n « n\\a — p\\ 2 + s/nN(Q, V) and 

(3 n = P a (nu n e 1(a)) = 0(e- n ^-P^/ 2V ) 

which means that the test is asymptotically consistent. A more detailed analysis of this test and its 
'local limiting power' [33J (when the alternative a converges to the null hypothesis p as n — > oo), 
will be pursued in a separate publication. 

8.2 Homogeneity test 

We are given n independent systems prepared in a state o\ G M(C d ) and another n-tuple of 
independent systems with state <7 2 € M(C d ), independent of the first ones. We would like to 
decide whether o\ = 02, i.e. 

H := {<Ji = cr 2 } Hi := {at ^ <r 2 } 

The test is based on a [/-statistic which gives an unbiased estimator of the distance \\ai — o^H 2 - Let 
K be the second order kernel on H = C d (g> C d 

K := ^{E iti ® 1 - 1 ® E^i) ® (Eu ® 1 - 1 ® + ^(T 3 , fe ® 1 - 1 ® T 3 - )fc ) ® (T,- fe ® 1 - 1 ® T jife ). 
Then 

Tr(af 2 ®af 2 ^) = ||p-a|| 2 , 

and as in the previous example the [/-statistic can be used as a test with critical region chosen 
according to the distribution of the limit quadratic form U. 



8.3 Quantum metrology 

The capacity to prepare and control quantum systems opens new possibilities for precision metrology 
with applications ranging from time and frequency standards to magnetometry and gravitational 
waves detection. Beating the standard quantum limit typically requires either special quantum 
states of many body systems (e.g. squeezed, or noon states) or special interactions generating the 
quantum dynamics [H [7J [29]. We discuss the latter case here in connection with the theory of 
quantum [/-statistics developed in this paper. Let K be a kernel of order r and let 

H n =YK^=( n ) Un 
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be an r-body interaction hamiltonian on n systems, generating a unitary evolution V^ ,n := exp(itjH n ) 
where the coupling constant 7 is an unknown parameter that we want to estimate. To do this, we 
let an initial state ipo evolve for some time and then perform an appropriate measurement on the 
final state ip]' n := V^' n ipo to obtain an estimator j n . In the special case when K — h<E> ■ ■ ■ (g) h with 
h a fixed selfadjont operator, the hamiltonian can be diagonalised explicitly and an argument based 
on the quantum Cramer-Rao inequality [H] suggests that the mean square error of 7„ may attain the 
rate n~ 2r , for special 'noon-type' initial states. However, since these states are hard to engineer, it is 
important to know whether similar rates can be achieved with product states V'o = 4>® ■ ■ ■ ® 4>- This 
is discussed in [7] where it is shown that the slightly worse rate n~ 2r+1 is achievable for some initial 
states, and the optimal estimation procedure consists of a sequence of finer and finer estimates, with 
separate measurements performed at each stage. 

Without entering into the details, we suggest that the convergence theory of quantum [/-statistics 
offers a general set-up for extending these results to generic r-body hamiltonians, mixed initial states, 
and for obtaining the limit distribution of the estimators. Indeed, suppose that 7 is of order n~ r+1 /' 2 
so that 7 = n~ r+1 / 2 g, and supposed that Ti(pf r K) = where po = \(j))(4>\ is the initial state of one 
constituent. Then by Corollary 17. II (applied to a pure state) we have 

lim WW") = I™ Tr (pf n exp(*t(ffi - g 2 )n' r+1 l 2 H n )) 

= E [exp(it( 9l - g 2 )U/r\)] = exp(-i 2 ( 5l - <? 2 ) 2 £i/2((r - l)!) 2 ) = 
= (ai\a 2 ) 

where U ~ iV(0, r 2 £i), £1 = Tr(piK 2 ) and \cti) are coherent states with displacement on = tgi\f^\_l (r— 
1)!. This can be interpreted as saying that in a neighbourhood of parameters 7 of size n _r+1//2 , the 
output states cj)J' n can be approximated by coherent states and the parameter can be estimated 
by a simple homodyne measurement. However, to do this one needs to have already narrowed the 
parameter down to the desired accuracy range! A way out could be to follow the philosophy of 
local asymptotic normality [131 1201 112) employed for optimal state estimation, and show that the 
above convergence holds uniformly over a range of parameters 7 which is larger that n _r+1 / 2 , more 
precisely n ~ r + 1 / 2 + e f or e > 0. Taking advantage of this, one could construct a sequence of finer and 
finer estimators with progressively narrower Gaussian distributions. 



9 Conclusions and outlook 

Symmetric operators on tensor product spaces can be seen as non-commutative functions of collective 
observables. [/-statistics are special examples of symmetric operators build from a single kernel K 
acting on a fixed number of systems. For i.i.d. states, the rescaled [/-statistics converge in moments 
to certain polynomials in the canonical variables of the CCR algebra arising from the Central Limit 
Theorem. In some cases the convergence can be upgraded to convergence in distribution but it 
remains an open question whether this can be done in general. This is closely is related to the 
problem of finding self-adjoint extensions of polynomials in canonical variables. 

As sketched in the previous section, [/-statistics may be employed in different statistical problems, 
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notably testing with non-standard hypotheses and precision metrology beyond the Heisenberg limit. 
We expect that in order to solve these problems, one needs to further develop the theory quantum 
[/-statistics. For example, achieving the rate n~ r+1 / 2 in parameter estimation requires the control 
of the rate of convergence to normality in Corollary 17.11 This is a classical topic in probability and 
statistics [Ml S] going back to the Berry-Esseen bounds for convergence rate in CLT [5J [TU] . 

Acknowledgements. M.G. would like to thank Gerardo Adesso for fruitful discussions and for pointing 
out the reference [7]. M.G. was supported by the EPSRC Fellowship EP/E052290/1. 
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